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Recent genome sequencing studies have identified sev- 
eral classes of complex genomic rearrangements that 
appear to be derived from a single catastrophic event. 
These discoveries identify ways that genomes can be 
altered in single large jumps rather than by many in- 
cremental steps. Here we compare and contrast these 
phenomena and examine the evidence that they arise 
"all at once." We consider the impact of massive chro- 
mosomal change for the development of diseases such as 
cancer and for evolution more generally. Finally, we 
summarize current models for underlying mechanisms 
and discuss strategies for testing these models. 



The evolution of a normal cell to a cancer cell is generally 
thought of as a sequential accumulation of many in- 
dependent lesions to the genome (Nowell 1976; Greaves 
and Maley 2012; Yates and Campbell 2012). In addition 
to somatic point mutations, these lesions include copy 
number alterations, such as chromosomal deletions or du- 
plications (Fig. 1A,B), and balanced structural rearrange- 
ments, such as translocations and inversions (Mitelman 
et al. 2007; Rabbitts 2009; Stephens et al. 2009; Stratton 
et al. 2009). By chance, these somatic alterations in the 
genome will hit cancer genes — disrupting tumor suppres- 
sors or activating proto-oncogenes — affecting cells and 
causing their progeny to gradually accumulate the nec- 
essary "hallmarks of cancer" (Hanahan and Weinberg 
2011). Although there is ample evidence that cancer can 
develop through this type of gradual mutational accrual, 
there are reasons to think that there could be advantages 
if the oncogenic changes could be acquired all at once (at 
least from the vantage point of the aspiring cancer cell). 
One obvious advantage is speed, with the possibility of 
rapidly accruing a large phenotypic effect from combi- 
natorial acquisition of smaller-effect genetic changes. 
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Another potential advantage of all-at-once or "punctu- 
ated equilibrium" (Gould and Eldredge 1977) mecha- 
nisms is that intermediate steps in tumor development 
can actually be deleterious. For example, oncogene acti- 
vation can trigger stress responses, such as p53 activation, 
that can inhibit proliferation or promote cellular senes- 
cence (Lowe et al. 1994; Fearnhead et al. 1997; Kastan and 
Bartek 2004; Halazonetis et al. 2008; Jackson and Bartek 
2009). In principle, an all-at-once mutational mechanism 
could circumvent this problem by simultaneously gen- 
erating an active oncogene together with changes that 
eliminate the stress response. 

Although all-at-once massive genomic alterations might 
accelerate the acquisition of growth-promoting mutations, 
they have an obvious downside: They could also poten- 
tially generate massive deleterious effects that could 
overwhelm any growth-promoting mutations. One way 
to minimize this collateral damage would be to have 
the massive genomic change be restricted to a portion of 
the genome. Although it is common to think of mutation 
rates as numbers that apply evenly across the genome 
(Durrett 2008; Bozic et al. 2010; Tomasetti et al. 2013), 
there is a growing appreciation of localized variability in 
these rates (Hodgkinson and Eyre-Walker 201 1; Hodgkinson 
et al. 2012; Roberts et al. 2012; Schuster-Bockler and Lehner 

2012) . Because of transcription-coupled DNA repair, 
transcriptionally silent portions of the genome have ap- 
proximately threefold higher mutation rates (Fousteri and 
Mullenders 2008; Lawrence et al. 2013). Likewise, late- 
replicating regions of the genome exhibit a similar fold 
increase in mutation rates (Stamatoyannopoulos et al. 2009; 
Chen et al. 2010; Koren et al. 2012; Donley and Thayer 

2013) . Localized hypermutation has also been associated 
with chromosomal rearrangements. In budding yeast, DNA 
breaks repaired by mitotic gene conversion are accompanied 
by surprisingly high mutation rates (> 1000-fold higher than 
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Figure 1. Characteristics of simple and complex chromosomal rearrangements. (A) Copy number losses or gains can be detected by 
array-based or sequencing-based technologies from abrupt changes in the read depth signal (sequencing) or the intensity of fluorescent 
probes (array). Loss or gain generally affects only one homolog; therefore, a copy number loss also results in LOH. (5) Copy number 
alterations are often generated by simple chromosomal rearrangements, such as deletions or tandem duplications. (C) Chromothripsis 
is characterized by an alternating copy number profile with loss and retention of heterozygosity. Minus signs in C and D denote 
chromosomal segments that are inverted. Fluorescent in situ hybridization (FISH) characterization confirms that only one chromatid 
(allele B) is affected. [D] Chromoanasynthesis is characterized by a copy number profile that alternates between euploid and higher 
ploidy. In contrast to chromothripsis, chromoanasynthesis does not necessarily exhibit LOH but reflects resynthesis of segments from 
one chromatid. Chromoanasynthesis also contains frequent insertions of short sequences between the rearrangement junctions (red 
arrows) that are copied from the rearranged segments (templated insertions). Both DNA resynthesis and templated insertions are 
suggestive of replication-based mechanisms. (E) Chromoplexy is characterized by a closed chain of translocations, with little or no copy 
number alteration. 
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spontaneous mutations) in the vicinity of the break (Hicks 
et al. 2010; Malkova and Haber 2012). Similarly, localized 
regions of hypermutation ("kataegis") that often occurred at 
the site of somatic rearrangements were discovered in breast 
cancer genomes (Nik-Zainal et al. 2012; Alexandrov et al. 
2013). Even more dramatic local genomic change can be 
generated from critical telomere shortening, resulting in 
breakage-fusion-bridge (BFB) cycles (McClintock 1941) that 
occur over several cell division cycles (and hence are not 
"all-at-once" events). Analysis of genome copy number 
alterations in cancer also identified "firestorms" of local 
amplifications and deletions whose origin is uncertain but 
might originate from BFB cycles (Hicks et al. 2006). 

Recently, whole-genome sequencing has led to the 
discovery of three new classes of complex catastrophic 
chromosomal rearrangement: chromothripsis, chromo- 
anasynthesis, and chromoplexy (Fig. 1C-E; Liu et al. 2011; 
Stephens et al. 201 1 ; Rausch et al. 2012; Baca et al. 2013). 
Although the underlying mechanisms for these phenom- 
ena are unknown and likely to be different, all of these 
patterns of rearrangements appear to have originated 
from a single event, simultaneously generating tens or 
even hundreds of translocations and/or DNA copy num- 
ber changes (Meyerson and Pellman 2011; Tubio and 
Estivill 2011; Forment et al. 2012; Jones and Jallepalli 
2012; Maher and Wilson 2012). To encompass the poten- 
tially all-at-once origin of these mutational mechanisms, 
a unifying term, chromoanagenesis, has been proposed 
(Holland and Cleveland 2012). Two of these classes of 
chromosomal rearrangement, chromothripsis and chro- 
moanasynthesis, affect limited portions of the genome 
(often a single chromosome or chromosome arm), raising 
the mechanistic question of how extensive chromosomal 
rearrangements can be so highly localized. The frequency 
of these types of events in cancer remains unclear, but 
many recent studies suggest that they may be more com- 
mon than initially suspected (Malhotra et al. 2013; Yang 
et al. 2013; Zack et al. 2013). These new classes of mu- 
tagenesis are not restricted to human cancer and have 
turned up in such exotic places as the Tasmanian devil's 
transmissible facial tumor (Deakin et al. 2012) and human 
congenital diseases (Kloosterman et al. 2011a, 2012; Liu 
et al. 2011; Chiang et al. 2012; Nazaryan et al. 2013). 
Although these new classes of chromosome rearrange- 
ment have mostly been found in the context of disease, it 
seems likely that they reflect very general mechanisms 
for genome alteration that could be important for organ- 
ismal evolution. 



A new game at the genome casino 

Genome instability is not only a hallmark but also an 
enabling characteristic of cancer (Negrini et al. 2010; 
Hanahan and Weinberg 2011; Gordon et al. 2012). Dis- 
ruption of genome maintenance and elevated rates of 
somatic mutations (e.g., due to disruption of DNA repair 
machineries or increased damage) accelerate the trans- 
formation from precancerous tissue into malignant tu- 
mors (Gorgoulis et al. 2005; Halazonetis et al. 2008; 



Vogelstein et al. 2013). Despite accelerated tumor evolu- 
tion, individual mutation events are usually regarded as 
independent of each other, acquired randomly and grad- 
ually. In addition to site-specific mutation events (point 
mutations, chromosomal translocations, and copy num- 
ber variations), doubling of the genome content due to 
cytokinesis failure is a one-off event that can generate 
aneuploidy and structural alterations of chromosomes 
(Fujiwara et al. 2005; Holland and Cleveland 2009; Carter 
et al. 2012; Ganem and Pellman 2012; Zack et al. 2013). 

In studying the tumor genome of a patient with chronic 
lymphocytic leukemia (CLL), Stephens et al. (2011) ob- 
served a striking pattern of chromosomal rearrangements 
that was at odds with all known models of genomic 
alteration. The tumor from this patient contained 45 
tumor-specific translocations, 42 of which were concen- 
trated on the long arm of chromosome 4. Furthermore, 
the rearrangement events were clustered within multiple 
focal regions in chromosome 4, raising the question of 
how the damage could be so highly localized. Did this 
result from a genuinely new mechanism for genome dam- 
age, or was the genome center playing the conventional 
game of random chance but had been dealt a rare hand? 

Several lines of evidence suggested this was, in fact, a 
novel phenomenon. First, despite the large number of 
rearrangements, SNP array copy number analysis showed 
that the altered region of chromosome 4 existed in only 
two copy number states, with many transitions between 
these two states. (As in the study by Stephens et al. (201 1 ), 
we use the term copy number states because of the highly 
aneuploid nature of cancer genomes, where each of the 
parental haplotypes can be present in multiple copies.) 
This pattern of having only two copy number states is in 
sharp contrast to conventional clusters of complex rear- 
rangements (Hicks et al. 2006; Bignell et al. 2007; Zhang 
et al. 2009a). Known causes of complex rearrangements 
such as BFB cycles characteristically result in multiple 
amplified copy number states that are assumed to derive 
from events occurring over many rounds of cell division 
(McClintock 1941; Stark et al. 1989; Gisselsson et al. 2000; 
Bignell et al. 2007; Greenman et al. 2012b). 

The alternation between two copy number states was 
accompanied by loss and preservation of heterozygosity 
(Fig. 1A,C). Whole-genome sequencing further revealed 
a very complex pattern of segmental rearrangements at 
the boundaries of high copy number states, with events 
that were head to tail (deletion type), head to head/tail to 
tail (inversion type), and tail to head (tandem duplication 
type) (Fig. 2A). However, despite this chaotic pattern of 
rearrangement, loci that preserved parental copy number 
also preserved heterozygosity of the parental genotypes 
(Fig. 1C). As discussed below, this maintenance of islands 
of heterozygosity among many large regions with loss of 
heterozygosity (LOH) is difficult to explain by multiple 
independent deletions. 

A third feature of the observation was that the pattern 
of end-joining strongly suggested an origin from a DNA 
double-strand break (DSB). As highlighted in later studies 
(Kloosterman et al. 2011a, 2012; Chiang et al. 2012), the 
preponderance of rearrangements involved ligation of both 
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Intra-chromosomal 

i. Deletion type Inferred insert size from reference alignment is longer than expected. 

Alignment to the reference head " tai1 " Breakpoints si|S2,S2|S3 

S1 S2 S3 

Rearranged assembly Fusion si|S3 

S1 ^~ S3 

ii. Tandem duplication type Relative orientation from reference alignment ("tail-head") is opposite to normal pairs ("head-tail"). 
Alignment to the reference ta '' ■ > head Breakpoints S1|S2,S2|S3 

SI S2 S3 

Rearranged assembly a ^ a ^ Fusion S2|S2 

S1 S2 S2 S3 

iii. Inversion type Relative orientation from reference alignment is "tail-tail" or "head-head." 

Alignment to the reference ■ > Breakpoints si|S2,S2|S3 

Si Si_r*~ S3 

Rearranged assembly ^ a Fusion si|-S2,-S2|S3 

si » -S2 S3 

Inter-chromosomal/long-range 

Pairmates are aligned to different chromosomes or loci separate by centromeres, assembly gaps, or other rearrangements. 

S3 S4 

Alignment to the reference 



si 



S2 



Chr.1 
Chr.2 



Breakpoints si|S2,S3|S4 
Fusion si|S4 



Rearranged assembly 



B Complex rearrangement clusters 

i. Overlapping tandem-duplication and deletion signatures 

" Discordant read-pair -^■ " m " 



Rearranged -►^ .... — 
genome l c 



signatures 



ii. Inverted segments disguise the deletion signature as a tandem duplication signature 



Rearranged _ ^ 

genome " 



Discordant read-pair 
signatures 



both in the rearranged genome and according to their alignment to the reference | 
deletion of segment A and swapping of segments B and C generates two "tandem 
none of which corresponds to a simple event, (ii) Two inversions alter a deletion 
arrows). 



Figure 2. Signatures of chromosomal rearrange- 
ments from paired-end sequencing. [A) Four major 
types of chromosomal rearrangement signatures 
deduced from paired-end sequencing: deletion type, 
tandem duplication type, inversion type, and long- 
range type (including interchromosomal). Each 
type of read pair shows aberrations from proper 
pairs when aligned to the reference assembly, 
which implies a new junction in the test chromo- 
some. A deletion (read pairs shown as red arrows) 
or tandem duplication (shown as green arrows) 
only introduce one fusion and hence can be un- 
ambiguously inferred from a single type of signa- 
ture (in the absence of other overlapping events). 
Inversion events, however, introduce at least two 
rearranged junctions (shown here is a simple in- 
version); hence, the rearranged genome can only be 
constructed by considering all related events. The 
same is true for long-range events with pair mates 
residing in different chromosomes or separated by 
centromeres, assembly gaps, or other rearrange- 
ments. In general, it is necessary to incorporate 
copy number information together with rearrange- 
ment signatures to derive the structure of the 
rearranged genome (for a computational framework, 
see Greenman et al. 2012a). [B) Examples of complex 
chromosomal rearrangements exhibiting multiple 
overlapping rearrangement signatures that do not 
reflect the same events as the "typing. " The discor- 
dant read pair signatures (color arrows) are shown 
;enome. (i) A complex rearrangement resulting in the 
duplication 77 and one "deletion 77 read pair signatures, 
signature into a tandem duplication signature (green 



ends of a single DNA break to other genomic segments 
(ends being operationally defined as sequences within ~1 
kb of each other,- hereafter referred to as a pairing of ends). 
This contrasted with complex genomic rearrangements 
that arise by replication-based mechanisms (Lee et al. 
2007; Carvalho et al. 2009, 2011) where the rearrange- 
ment is often seen only on one side of the initial lesion 
(Haber and Debatisse 2006; see below for more details). 

The final defining feature of this new mutagenesis 
event was the nonrandom, highly localized clustering 
of chromosomal breakpoints. These four hallmarks — 
alternating copy number profiles, loss and preservation 
of heterozygosity, pairing of the breakpoint ends, and 
clustering of rearrangements — define the class of chro- 
mosomal alteration termed chromothripsis (Greek for 
shattering of chromosomes). 

All at once? 

The defining hallmarks of chromothripsis cannot be 
explained by a gradual accumulation of independent 
events. First, truly independent chromosomal rearrange- 
ments are expected to be randomly distributed across the 
genome rather than clustered in focal regions (Beroukhim 
et al. 2007, 2010). Second, the gradual accumulation of 
such a large number of overlapping rearrangements, both 
copy number neutral and copy number changing, would 



almost certainly produce a final outcome with more than 
two copy number states. Finally, maintenance of islands 
of heterozygosity interspersed between regions that have 
lost heterozygosity is hard to explain by multiple sequen- 
tial events. This is illustrated by the fact that a deletion 
early in the process will remove sequences that cannot 
be regained. Thus, if the observed pattern resulted from 
sequential mutation events, deletions would need to be 
restricted to a later time period, when they could not 
interrupt the acquisition of balanced rearrangements — a 
complex "programmed" event for which it is hard to 
envision a mechanism (one example is shown in Fig. 3). 

In contrast to the gradual model, Stephens et al. (2011) 
proposed a much more compelling catastrophic model of 
chromothripsis that offers a parsimonious explanation of 
the observation: One chromosome (or regions from one or 
more chromosome arms) is shattered into segments of dif- 
ferent lengths in a single catastrophic event; the segments 
are then stitched together, most likely by nonhomolo- 
gous end- joining, resulting in a derivative chromosome 
containing a subset of the shattered segments in random 
orientations. The retained segments maintain the paren- 
tal copy number and heterozygosity, yielding the high 
copy number state. The segments that are not incorpo- 
rated into the derivative chromosome are either lost, 
yielding the low copy number state, or incorporated into 
a double-minute chromosome, as discussed below. 
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Figure 3. Reconstruction of a local complex 
rearrangement and its mutational history 
from read pair signatures and copy number 
information. (A) Copy number alterations can 
be detected by array methods or read depths 
in sequencing data; rearrangement signatures 
can be detected from discordant read pairs in 
B Reassembled sequencing data. [B] By combining copy num- 
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come in B is often not unique. However, all 
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and generate three new junctions (L|C, C|B, 
and B|R). A catastrophe model can easily 
solve this puzzle because all of the breaks 
and translocations occur all at once. By com- 
parison, a solution from the gradual model of 
simple events requires at least three simple 
events to generate the final configuration (one 
for each new junction). As each event causes 
two breakpoints, three events need six break- 
points, which requires two additional breaks 
in addition to the four original junctions to be 
broken. These two additional breaks arise 
from either within the lost A segment (lost history) or hitting the same junction multiple times (recurrent breaks). Shown here is one 
possible solution that involves recurrent breaks at both B|C and C|R junctions. Both lost history and recurrent breaks are highly unlikely. 
As the number of rearranged fragments increases, the need to include these types of unlikely events further increases, strongly arguing 
against the gradual model. 
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Historiography of the cancer genome 

These general arguments make intuitive sense but ideally 
would be backed up by a detailed molecular assembly of 
the evolutionary path taken to generate the chromothrip- 
tic chromosome. Stephens et al. (2011) faced the obvious 
challenge of inferring prior history from a terminal snap- 
shot of the present, which is already difficult for cancer 
genomes with simple rearrangement patterns (Greenman 
et al. 2012a) and quickly becomes impractical for a com- 
plex case such as chromothripsis (Yates and Campbell 
2012). 

The first complication for inferring an evolutionary 
trajectory is purifying selection: The mature tumor is 
biased to preserve genomic changes that are beneficial to 
tumor development and to have lost deleterious changes 
(Beroukhim et al. 2007; Greenman et al. 2007; Bignell 
et al. 2010). By viewing only the endpoint of selection, we 
could miss intermediate steps in the development of the 
rearranged chromosome whose signatures were lost during 
tumor evolution. An additional technical complication 
for solving the genomic Rubik's cube for the chromothrip- 
tic chromosome is that one cannot interpret paired-end 
sequences in a simple straightforward way when they oc- 
cur in the context of other overlapping rearrangements. 
For example, a discordant pair of sequencing reads that is 
inferred to be a "tandem duplication type" event when 
directly mapped onto the reference genome sequence 
(Fig. 2A ; Raphael et al. 2003; Tuzun et al. 2005; Korbel 



et al. 2007; Campbell et al. 2008; Medvedev et al. 2009) 
has a completely different interpretation under a rear- 
ranged genome when additional rearrangement events 
have occurred between the pair mates (Fig. 2B). This 
technical issue is less problematic for simple genomes 
where rearrangements seldom overlap and can be de- 
coupled and interpreted unambiguously (Greenman et al. 
2012a,b; Quinlan and Hall 2012). However, it becomes 
prohibitively complex in chromothripsis, where tens or 
hundreds of rearrangements overlap, resulting in an enor- 
mous number of possible permutations. This provides an 
important cautionary note for aspiring genome analysts: 
An accurate interpretation of a highly rearranged cancer 
genome from paired-end sequencing data requires an 
integrative analysis of rearrangement signatures (such 
as deletion type or tandem duplication type) (Fig. 2A) and 
copy number data. In other words, context matters. 

The following back-of-the-envelope calculations il- 
lustrate the magnitude of the problem. The number of 
different outcomes created by random permutations of 10 
overlapping rearrangements is 10! = 3,628,800 and the 
number grows to ~10 64 for 50 rearrangements. Typical 
chromothripsis harbors tens or even hundreds of over- 
lapping rearrangements, and it is clearly impossible to 
enumerate all possible outcomes. Additional complica- 
tions are caused by intermediate events being obscured 
by later events,- e.g., a later deletion event that removes 
the signature of an earlier translocation. This estimate 
also does not account for rearrangements that are not 
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detectable due to technical issues such as uncertainties in 
the reference genome assembly or inaccuracies in short- 
read alignment. 

The most difficult challenge stemming from the vast 
number of outcomes due to permutation complexity, 
however, is finding not just one solution, but the most 
likely solution based on a certain background statistical 
model of tumor evolution. Ultimately, we want to know 
the likelihood of the multievent solution to chromothrip- 
sis based on the background model of tumor evolution to 
dismiss the gradual model in favor of a single catastrophe. 
(Fig. 3 illustrates this problem by a simple example.) To 
answer this last question without enumerating all possi- 
ble outcomes of a gradual model, Stephens et al. (2011) 
focused on the number of copy number states during 
genome evolution, essentially reducing the multidimen- 
sional space of chromosome configurations to a single 
dimension of "copy number." They set up simulations in 
which individual rearrangement events were applied to 
the genome sequentially, starting from the reference se- 
quence. Instead of brute force enumeration of all genomic 
configurations, the eventual number of copy number 
states was scored, making a testable prediction about 
the structure of the chromothriptic chromosome without 
the need for a complete molecular reconstruction of the 
rearranged chromosome. The gradual model rapidly gen- 
erated multiple high copy number states because gradual 
accumulation allows for overlapping tandem duplications 
to affect the same region multiple times. The discrepancy 
between the large copy number complexity implied by the 
gradual model and the hallmark two copy number states 
observed in chromothripsis provided a strong statistical 
argument against a gradual model for chromothripsis. It is 
worth noting that chromothriptic chromosomes can 
sometimes exhibit multiple copy number states (Stephens 
et al. 2011; Rausch et al. 2012). One explanation is the 
overlap between chromothripsis and other independently 
generated copy number gains,- another possibility is that 
chromothripsis that occurred during or after DNA repli- 
cation can result in a shattering and assembly of both 
sister chromatids to generate copy number gains. In either 
scenario, however, the low copy number states (three or 
four) are still in stark contrast to the rapid accumulation of 
multiple copy number states, as the gradual model suggests. 

The focal pattern of breakpoints in chromothripsis 
provides yet another argument against the gradual model. 
Independent rearrangements are generally assumed to be 
randomly distributed across the genome. The large size of 
the human genome implies that it is extremely unlikely 
to see two events that are very close to each other (~1 kb 
apart). The likelihood of seeing tens or hundreds of break- 
points within one chromosome (or chromosome arm) 
with few events elsewhere in the genome is vanishingly 
small if these events were independent and not due to 
specific targeted mechanisms. This statistical argument 
suggests that clustered rearrangements in chromothripsis 
very likely occurred simultaneously. A generalization of 
this statistical argument into a systematic way of in- 
ferring co-occurring translocations ("chained" transloca- 
tions) was recently proposed (Baca et al. 2013). This new 



approach, discussed further below, potentially provides a 
complementary method to validate the idea that chro- 
mothriptic lesions occurred at the same time. 

In summary, chromothripsis is best explained by a cat- 
astrophic all-at-once model and cannot be reconciled 
with the gradual model. The catastrophic model yields 
a simple explanation of the low number of observed copy 
number states. It also provides an explanation for the 
many overlapping tandem duplication type events that do 
not increase copy number (see Fig. 2B, i, for an example): 
These duplication type events are instead explained by 
the random orientations at which the segments are stitched 
together. Similar explanations can apply to deletion and 
other "type" events (Fig. 2B). 

When is it chromothripsis? 

The discovery of chromothripsis in cancer immediately 
raises the question of how frequently it occurs. After the 
initial discovery, there have been a plethora of studies 
confirming that chromothripsis is indeed a widespread 
and previously unappreciated phenomenon in both pri- 
mary tumors (Kloosterman et al. 201 lb; Magrangeas et al. 
2011; Berger et al. 2012; Rausch et al. 2012) and cancer 
cell lines such as HeLa (Adey et al. 2013; Landry et al. 
2013). Many cancer genomes reported in earlier studies 
also exhibited features that resemble chromothripsis 
(Stephens et al. 2009; Campbell et al. 2010). So how are 
we to recognize chromothripsis? How many rearrange- 
ments "make" the definition, and how focal is focal? 
Because copy number data are the most available, a com- 
mon sense approach was to adopt operational definitions 
that would require, say, 10, 20, or 50 oscillating copy 
number changes (Magrangeas et al. 2011). In some cases, 
corrections were applied to normalize the number of focal 
events to the general frequency of events on that chro- 
mosome (Kim et al. 2013). 

Although an operational criterion is reasonable for a 
first-pass screening and many instances with multiple 
oscillations between two copy number states likely are 
bona fide chromothripsis, the field is moving away from 
operational definitions and adopting criteria that incor- 
porate information from paired-end sequencing. Through 
more precise analysis of the structure of the derivative 
chromosome, one can better distinguish chromothripsis 
from a large number of progressively acquired rearrange- 
ments in tumors with high levels of genetic instability. 
Because an excellent review by Korbel and Campbell 
(2013) has recently covered this topic in depth, we only 
briefly review the main elements of these criteria here. 

As discussed above, one hallmark of chromothripsis is 
the clustering of rearrangements. The degree of clustering 
can be assessed by simple statistical methods because the 
null hypothesis of random breakpoints predicts that the 
distance between breakpoints should be distributed ex- 
ponentially (Beroukhim et al. 2010; Malhotra et al. 2013). 
However, it is important to note that clustering of break- 
points can also be generated by other processes involving 
multiple steps over several cell division cycles. For exam- 
ple, clustering can result from BFB cycles (Gisselsson et al. 
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2000), chromosome fragile sites (Debatisse et al. 2012), and 
strong positive selection for local oncogene amplifications 
(Bignell et al. 2007). Thus, clustering by itself is a necessary 
but not sufficient criterion for chromothripsis. An addi- 
tional criterion is provided by the fact that chromothripsis 
is characterized by regions with interspersed retention and 
LOH, which can be verified from genotype information 
from either SNP array or sequencing. Finally, if restricted 
to a single chromosome, the regions of rearrangement and 
segment loss should be confined to a single parental 
haplotype because the shattering and stitching back 
together occurs (mostly) on a single chromatid. This can 
be directly verified by cytogenetic methods, such as 
fluorescence in situ hybridization (FISH), or inferred from 
phased haplotypes in trio family studies (Browning and 
Browning 2011). 

Additional information from sequencing can evaluate 
the model that a chromosome has been shattered and 
randomly stitched together in more detail. First, the 
model predicts that the order of the fragments on the 
derivative chromosome would be random or near random 
with respect to their positions on the reference genome. 
Second, if reassembly of the derivative chromosome is 
random, the orientations of fragments on the derivative 
chromosome should also be random. This means that 
there should be no bias for fragments to be joined head to 
tail (deletion type), tail to head (tandem duplication 
type), head to head, or tail to tail (inversion type). Such 
randomness in the orientation of fragments contrasts 
with, for example, BFB cycles, which are characterized 
by inverted segments biased to be head to head or tail to 
tail (Bignell et al. 2007). Finally, if the shattering and 
restitching indeed happen all at once, then each rear- 
ranged fragment will have a unique head and tail sequence 
(with respect to the reference sequence). Thus, when 
"walking" down a chromothriptic chromosome, there 
will be an invariant alternation between head and tail 
(or tail and head) sequences, reflecting a linear connection 
(Korbel and Campbell 2013). This contrasts with the 
prediction for a mechanism involving sequential acqui- 
sition of rearrangements, particularly those involving 
amplifications. In this latter case, rearranged segments 
could be "reused" in ways that would generate heads 
followed by heads and tails followed by tails when ampli- 
fications are nested. 

Clearly, within a short period of time, the field has 
advanced far beyond "know it when you see it" opera- 
tional definitions for chromothripsis. The above-delin- 
eated criteria each address a different hallmark feature of 
chromothripsis and can be subjected to rigorous statisti- 
cal evaluation. Nevertheless, challenges remain. How 
exactly to combine multiple criteria and weigh them 
properly to make the most probable case for or against 
chromothripsis needs further development. There are also 
numerous technical challenges. Short sequence reads can 
only be aligned to the mappable regions of the genome, and 
the interesting question of what happens within highly 
repetitive sequences remains to be resolved. Complica- 
tions also come from other factors, such as inaccuracies 
in short-read alignments, the obscuring effect of other 



structural rearrangements (e.g., simple deletions/ampli- 
fications/translocations) layered on top of chromothrip- 
sis, tumor heterogeneity, and aneuploidy. All of these 
technical factors may lead to an underestimate of the 
frequency of chromothripsis. 

How frequently does chromothripsis happen? 

So what, then, is our best guess for the frequency of 
chromothripsis? Unfortunately, the largest volume of can- 
cer genomic data (such as in The Cancer Genome Atlas 
[TCGA] project) is array-based and does not have the power 
to provide an accurate estimate of the overall frequency of 
chromothripsis. However, it is becoming evident that the 
frequency of chromothripsis tends to vary between dif- 
ferent tumor tissue types. In a survey of whole-genome 
sequencing data for 64 samples of seven tumor types from 
TCGA, Malhotra et al. (2013) found no chromothriptic 
rearrangements in basal-like breast cancers, colon adeno- 
carcinomas, ovarian cancers, or renal adenomas but found 
rearrangements consistent with chromothripsis in two of 
six (33%) lung adenocarcinomas, two of 13 (15%) lung 
squamous cell carcinomas, and 39% of glioblastomas (n = 
18). In a recent survey of 4934 cancers from TCGA, Zack 
et al. (2013) employed multiple criteria to model chromo- 
thripsis by SNP array copy number alone and suggested 
that chromothripsis occurred in 5% of all samples, with 
frequencies ranging from 0% in head and neck squamous 
carcinoma to a maximum of 16% in glioblastoma. In ad- 
dition, chromothripsis has been observed in colorectal 
cancer (Kloosterman et al. 2011b), melanoma (Berger 
et al. 2012; Hammond et al. 2013; Hirsch et al. 2013), 
small-cell lung cancer (Iwakawa et al. 2013), prostate 
cancer (Wu et al. 2012), and a variety of brain tumors 
(Molenaar et al. 2012; Northcott et al. 2012; Brastianos 
et al. 2013) and blood malignancies (MacKinnon and 
Campbell 2013; Morin et al. 2013; Nagel et al. 2013). It 
is interesting to point out that the rate of chromothripsis 
seems to be unrelated to the overall rate of somatic copy 
number alterations (SCNAs) (Zack et al. 2013). 

The high frequency of chromothripsis in certain tumor 
types suggests that chromothripsis depends on the ge- 
netic and environmental background of cancers. Among 
the most frequently mutated genes in human cancer is 
TP 53 (Olivier et al. 2010), which plays central roles in 
maintaining genome stability (Zhu et al. 2002; Meek 
2004; Thompson and Compton 2010). The correlation 
between TP53 mutation and chromothripsis was first 
demonstrated in Sonic Hedgehog (SHH) medulloblastoma. 
Rausch et al. (2012) applied high- throughput whole-genome 
sequencing and long-fragment mate-pair sequencing to 
characterize genomic rearrangements in four patients of 
SHH medulloblastoma who also had germline-inactivat- 
ing TP 53 mutations (Li-Fraumeni syndrome). Sequencing 
of each of these tumors not only confirmed clustering of 
chromosomal breaks indicative of chromothripsis but 
also revealed high-level amplifications due to double-minute 
chromosomes that originated from the shattered frag- 
ments. Thus, TP 53 mutations are permissive or condu- 
cive for either the initiating chromothriptic event, the 
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selective advantage of cells bearing the rearranged chro- 
mosome, or both. Rausch et al. (2012) further identified 
strong correlations between TP53 mutation and chromo- 
thripsis in acute myeloid leukemia, demonstrating that 
the effect of TP53 mutation can manifest in multiple 
tissue types. 

How accurate are the estimates of the frequency of 
chromothripsis? Although there could be some overesti- 
mation due to ambiguities in the interpretation of geno- 
mic data, we must also consider underestimation due to 
the fact that the overwhelming majority of chromothrip- 
tic events are almost certainly lost because of deleterious 
fitness effects. Negatively selected events in normal or 
premalignant tissues could be detected only at a single- 
cell level immediately after the chromothriptic rear- 
rangement occurs, which is currently not feasible on the 
appropriate scale. Ultimately, understanding the mecha- 
nisms that generate chromothripsis will be integral to 
estimating its frequency. If the underlying cellular defects 
could lead to, but not cause, chromothripsis every time 
they occur, then the observed cases of chromothripsis 
only represent the tip of the iceberg for less dramatic but 
possibly more common mutational events. 

Does chromothripsis cause cancer? 

In general, we expect that the disruption of chromosomal 
integrity in a catastrophe such as chromothripsis will be 
negatively selected or even trigger apoptosis. However, 
this does not exclude the possibility that rare catas- 
trophes could promote cellular transformation, which 
could happen in at least two ways through copy number 
alterations. First, the massive DNA loss can include one 
or more tumor suppressor genes (Stephens et al. 2011; 
Yang et al. 2013). In their survey of the TCGA SNP array 
data, Zack et al. (2013) found that 72% of chromothripsis 
events resulted in SCNAs in regions that are frequently 
disrupted in cancer, with the most dramatic case in 
glioblastoma showing that 20 out of 22 samples with 
chromothripsis had one event disrupting the CDKN2A 
locus. Second, and perhaps most importantly, shattered 
chromosomal segments not incorporated into the deriva- 
tive chromosome can be linked to form a double-minute 
(circular) chromosome. These double minutes are readily 
amplified, which commonly occurs if they harbor an 
oncogene. An illustration of this comes from the original 
reports on chromothripsis (Stephens et al. 2011; Rausch 
et al. 2012). Stephens et al. (2011) described a small-cell 
lung cancer cell line where a chromothriptic event gen- 
erated a derivative chromosome 8 and a double minute 
containing MYC and other segments of chromosome 8. 
Most notably, the sequences on the derivative chromo- 
some and the double minute were exclusive, providing 
compelling evidence that early in the development of this 
tumor, chromosome 8 was shattered, with some bits 
being contemporaneously incorporated into the deriva- 
tive and others going into the double minute. 

Having recognized the association between chromo- 
thripsis, double minutes, and cancer-causing mutations, 
one can infer chromothripsis as the most likely underly- 



ing mechanism in the formation of high-level focal 
amplifications that incorporate multiple disconnected 
segments (Iwakawa et al. 2013; Sanborn et al. 2013). The 
multiple focal amplifications can be either extrachromo- 
somal (double minutes) or intra chromosomal (homoge- 
neously staining regions [HSRs]) due to chromosomal 
integration of double minutes (Storlazzi et al. 2010; 
Gibaud et al. 2013). Indeed, data linking the formation 
of double minutes to the formation of derivative chro- 
mosomes containing complex rearrangements were 
known prior to the first description of chromothripsis. 
Gibaud et al. (2010) characterized five different extra- 
chromosomal amplifications of double-minute chromo- 
somes in a glioma sample: One double-minute chromo- 
some consisted of multiple segments from chromosome 5 
and chromosome 9, which, in retrospect, was likely to 
have resulted from chromothripsis. FISH analysis verified 
that the extrachromosomal amplicon sequences were not 
present in the derivative chromosomes derived from 
chromosome 5 and chromosome 9. We further note that 
copy number profiles of both chromosomes showed the 
hallmark copy number patterns suggestive of chromo- 
thripsis. In addition, the rearrangements also created 
oncogenic lesions. The double minute contained the 
TERT (telomere reverse transcriptase) gene from a frag- 
ment in chromosome 5. TERT is amplified or overex- 
pressed in many cancer types (Beroukhim et al. 2010). The 
functional significance of TERT overexpression is re- 
markably underscored by the recent discovery that 
—80% of glioblastomas harbored mutations in the 5' 
flanking region of TERT that activate its expression 
(Horn et al. 2013; Huang et al. 2013; Killela et al. 2013). 
Interestingly, the glioma sample studied by Gibaud et al. 
(2010) also had deletions encompassing the CDKN2A 
locus on the chromosome 9 derivatives. Thus, one 
episode of chromothripsis appears to have generated 
two critical oncogenic lesions (TERT amplification and 
CDKN2A deletion). 

Besides copy number alterations, translocations in 
chromothripsis can also lead to gene fusions or dysregu- 
lated gene expression. Mehine et al. (2013) suggested that 
chromothripsis is a frequent driver event in uterine 
leiomyomas, resulting in increased expression of the trans- 
located HMGA1 and HMGA2 genes. However, the proba- 
bility of generating in-frame gene fusions — the major class of 
oncogenic translocations — through random chromosomal 
translocations is generally small (Stephens et al. 2009), and 
the observed frequency of in-frame fusions due to chromo- 
thripsis is comparable with those resulting from simple 
rearrangement events (Stephens et al. 2011). Nonetheless, 
although each rearranged segment in chromothripsis has 
a small probability of generating a functionally important 
fusion gene (Teles Alves et al. 2013), it is reasonable to 
speculate that chromothripsis, because it generates many 
rearrangements, will increase the frequency of such fusions. 
This is not because chromothripsis has any unique ability 
to generate gene fusions but simply reflects the fact that 
in chromothripsis, more shots are taken on goal. 

Although there is good reason to think that chromo- 
thripsis can generate oncogenic alterations and potentially 
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multiple mutations at one fell swoop, we are just starting to 
understand the impact of chromothripsis on cancer patho- 
genesis or prognosis. Recent studies have shown that 
chromothripsis can independently arise in either the pri- 
mary tumor or its metastasis (Kloosterman et al. 201 lb) and 
does not necessarily predict resistance to drug treatment 
(Bassaganyas et al. 2013). Even though chromothripsis 
has been suggested as a poor prognostic factor (Molenaar 
et al. 2012; Hirsch et al. 2013), direct experimental 
characterization of chromothripsis-derived mutations 
is lacking. 

One important question is when chromothripsis occurs 
during tumor development: Can it be an early driving 
event, or is it just another consequence of genomic chaos 
in late stage cancers? The latter possibility is intuitive, 
but there is also interesting evidence for chromothripsis 
occurring in the absence of massive genetic instability. 
First, allele frequencies of chromothriptic rearrange- 
ments suggest that chromothripsis generally occurs ear- 
lier than simple rearrangements (Malhotra et al. 2013). 
Moreover, the high frequency of chromothripsis identi- 
fied in tumors with low to moderate mutation rates 
(neuroblastoma, medulloblastoma, glioblastoma, and 
blood and bone marrow malignancies) suggests that chro- 
mothripsis does not, in any obligatory way, require a back- 
ground of elevated genomic instability. On the other hand, 
tumor types with frequent copy number alterations, such 
as ovarian or lung cancers, exhibit a larger number of 
chromosomal alterations, but they are often indepen- 
dently acquired rather than correlated as in chromothrip- 
sis (Malhotra et al. 2013; Zack et al. 2013). Additionally, 
as described in more detail below, de novo complex 
rearrangements that resemble chromothripsis have also 
been observed in patients with congenital disorders. So 
far, there has been no evidence that these patients have 
a cancer predisposition or defects in the maintenance of 
genome stability like patients with germline mutations 
in the TP 53 or BRCA genes. Together, these findings imply 
that chromothripsis can originate in cells with normal or 
near-normal genetic stability. 

Chromothripsis in congenital disorders 

Strong selection during embryonic development en- 
sures that only mild genomic alterations can be tolerated 
through development. It was therefore very surprising 
when recent studies reported massive constitutional 
chromosomal rearrangements similar to those of chro- 
mothripsis in cancer in some patients with congenital 
developmental disorders (Kloosterman et al. 2011a, 2012; 
Chiang et al. 2012; Nazaryan et al. 2013). In comparison 
with chromothripsis in cancer, these instances of appar- 
ent germline chromothripsis showed minimal DNA loss, 
presumably reflecting the selection against haploinsuffi- 
ciency or complete loss of gene function due to mono- 
allelic gene expression. Array-based copy number analysis, 
such as array comparative genome hybridization (aCGH) 
or SNP arrays, cannot, as expected, detect these copy- 
neutral chromosomal alterations, necessitating whole- 
genome sequencing. 



Chromoanasynthesis 

A different pattern of constitutional complex genome 
rearrangements was reported by Liu et al. (2011) in their 
investigation of patients with congenital developmental 
defects. The investigators combined aCGH and FISH to 
characterize copy number alterations and chromosomal 
rearrangements in these patients and sequenced the PCR 
products at rearrangement junctions to resolve break- 
point sequences. In contrast to chromothripsis, Liu et al. 
(2011) observed frequent copy number gains with inter- 
spersed regions containing duplication or triplication of 
one parental allele but no LOH (Fig. ID). Breakpoint junc- 
tion sequences exhibited microhomology but also in- 
cluded templated sequence insertions (100-1000 base 
pairs [bp]) on the rearranged chromosome. The short 
template insertions suggested a replication-based mech- 
anism for incorporating these segments during the repair 
of DNA DSBs (Lee et al. 2007; Hastings et al. 2009). The 
observation of microhomology further led Liu et al. (201 1 ) 
to propose microhomology-mediated breakage-induced 
repair (MMBIR) as the major mechanism for generating 
the observed translocations (Carvalho et al. 2009; Zhang 
et al. 2009b; Liu et al. 2012). In this model, the extra allele 
copies are incorporated due to template-switching events 
in DNA replication instead of ligation (via end-joining) 
of shattered fragments. To emphasize the difference in 
mechanisms, Liu et al. (2011) termed this phenomenon 
chromoanasynthesis (synthesis of new chromosomes). 
They also suggested that the same replicative mechanism 
may play a role in chromothripsis due to similar features 
in both phenomena, such as localized clusters of rear- 
rangements and copy number alternations. 

One of the hallmark features of chromothripsis is 
interspersed loss and preservation of heterozygosity, which 
suggests a "shatter-and-stitch" process of one chromatid. 
This is in sharp contrast to chromoanasynthesis, which 
reflects extra synthesis of DNA segments. Although extra 
synthesis can lead to alternating DNA copy numbers, it 
does not necessarily result in LOH: LOH occurs by DNA 
deletion or gene conversion. Another difference between 
chromoanasynthesis and chromothripsis is that adjacent 
chromosomal ends in chromothripsis (especially in the 
germline) often result from one DNA DSB, whereas seg- 
mental breaks in chromoanasynthesis are separated by 
longer distances (0.01 — 1 Mb) and appear to result from 
template-switching events. Although replication-based 
DNA repair can contribute to chromothripsis, as evi- 
denced from examples of templated insertion sequences 
(Chiang et al. 2012), it is difficult to explain the proximity 
of adjacent chromosomal breaks based purely on template 
switching without DNA DSBs. Finally, chromoanasyn- 
thesis results in one aberrant chromosome with duplicated 
segments due to extra DNA synthesis,- in contrast, chro- 
mothripsis generally gives rise to one or multiple deriva- 
tive chromosomes with either massive DNA loss (in 
cancer) or no apparent DNA copy number alterations (in 
the germline). Based on these differential characteristics, 
we agree with the conclusion that chromoanasynthesis is 
a distinct phenomenon from chromothripsis (Holland and 
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Cleveland 2012; Liu et al. 2012; Kloosterman and 
Cuppen 2013). 

The distinction between chromothripsis and chromo- 
anasynthesis is also evident in a more recent report of 
germline genomic rearrangements. Kloosterman et al. 
(2012) identified eight cases that resemble chromothrip- 
sis, exhibiting random stitching of segments from multi- 
ple chromosomes with little DNA loss, and two cases 
with apparent gain or deletion of DNA segments that are 
suggestive of replication errors. One interesting observa- 
tion is that the rearranged alleles in both chromothripsis 
and chromoanasynthesis are preferentially of paternal 
origin,- this was attributed to errors that result from the 
higher number of mitotic divisions in gametogenesis in 
males relative to females (Kloosterman and Cuppen 2013). 
In contrast, most human germline aneuploidies are ma- 
ternally derived (Handyside et al. 2012; Nagaoka et al. 
2012), and de novo chromosomal instability in early em- 
bryonic development does not show a preferred origin 
(Vanneste et al. 2009; Voet et al. 2011). 

Chromoplexy 

Chromothripsis and chromoanasynthesis are two exam- 
ples of complex genomic rearrangement triggered or 
generated by a single event. A single catastrophic event 
was also invoked to explain a recently described pattern 
of rearrangement found in prostate cancers (Fig. IE; 
Berger et al. 201 1; Baca et al. 2013). Termed chromoplexy 
(Greek for braid of chromosomes), this event is charac- 
terized by "chains" of translocations involving multiple 
chromosomes — up to eight so far described. The sequence 



signatures of chromoplexy suggest that all of the partner 
segments involved in the translocations originate from 
DNA DSBs, with both ends of each break contributing to 
the daisy chain cycle of translocations (Fig. 4A). This 
organization of breakpoint ends bears some similarity to 
the pairing of the ends described above in chromothripsis. 
As with chromothripsis, the pairing pattern strongly 
implies that chromoplexy is another example of an all- 
at-once mechanism for chromosomal rearrangement: If 
the breakpoint ends originate from DSBs and are joined 
into an integrated pattern of translocations, then the breaks 
must have occurred simultaneously. 

To put the intuitive argument into a more rigorous 
framework, Baca et al. (2013) developed an algorithm 
called ChainFinder to identify the chained translocations 
of chromoplexy. The assumption is that the probability 
that the observed translocations of two adjacent break- 
point ends occurred simultaneously is related to the 
probability that two breakpoint ends originate from a 
single DSB. The inference of a DSB is not straightforward 
because the breakpoint ends originating from a single 
DSB due to resection and other processing are not pre- 
cisely connected in the reference sequence (see Fig. 4B for 
an illustration); a null model for independently generated 
breaks is therefore necessary for a statistical inference. To 
create the null model of independently generated chro- 
mosomal breaks, ChainFinder first estimates the density 
of chromosomal breaks at any locus by averaging over 
multiple tumor samples and then multiplying this by the 
total number of chromosomal breaks detected in a par- 
ticular sample. This generates a sample-specific, locus- 
dependent frequency of breaks. Based on the null model 
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of local breakage frequency, ChainFinder calculates the 
probability that two adjacent translocation ends have 
originated from independent breaks (Fig. 4B). This prob- 
ability monotonically decreases with the separation be- 
tween the two ends and becomes vanishingly small 
when the two ends are very close (100—1000 bp), enabling 
a rejection of the null model that the two translocations 
occurred independently. ChainFinder then searches for 
all "cycles" of rearrangements (closed chains of rearrange- 
ments) and tests the null model for every two pairs of 
adjacent breakpoints: If both pairs of ends have a rea- 
sonable chance of having originated from independent 
breaks, then these two pairs of ends can arise from two 
independent events, and the original cycle is reduced to 
two smaller cycles. Chromoplexy events are therefore 
defined as chains of translocations that cannot be further 
reduced. 

The ChainFinder algorithm is conceptually different 
from and potentially complementary to the Monte Carlo 
simulations used by Stephens et al. (2011) to infer that 
complex rearrangements have resulted from a single 
catastrophic event. The Monte Carlo simulations moni- 
tor DNA segment copy number as a surrogate, enabling 
rejection of the gradual model without directly testing 
the all-at-once model. In contrast, ChainFinder directly 
examines the interdependency of rearrangements and 
thus gets closer to a direct test of the all-at-once hypoth- 
esis. The complex patterns of rearrangements in chromo- 
thripsis are almost certainly "chained"; it is thus possible 
to apply ChainFinder to deconvolute chromothripsis and 



chromoplexy from other independent rearrangements 
that occur on the same chromosome or its homologous 
counterpart. Malhotra et al. (2013) adopted a similar 
heuristic idea to search for clusters of translocations to 
infer chromothripsis. A more rigorous generalization of 
the idea of "two simultaneous translocations due to one 
DSB" as judged from the spatial proximity of breakpoint 
ends could be a powerful method to deconvolute the 
evolution history of rearranged genomes. 

The following example illustrates the potential for a 
more general application of this type of approach to re- 
solve complex rearrangements into smaller chains and 
decipher the history of events during tumor evolution. In 
an early report of germline chromothripsis (Kloosterman 
et al. 201 la), we noticed that the complex rearrangements 
of multiple chromosomal segments in three chromo- 
somes (1,4, and 10) can possibly be reduced to two cycles: 
the first cycle containing three DSBs originating from 
each chromosome (Fig. 5A,B) and the second containing 
only DSBs in chromosome 4 and chromosome 10 (Fig. 
5B,C). Because the pairs of adjacent breakpoint ends are 
close to each other (<1000 bp), the translocations are 
almost certainly chained together in each cycle. Interest- 
ingly, if the first chain of three translocations occurred 
all at once, the result would have been a functional 
centromere-containing derivative chromosome, t(4*,10q), 
which is retained in the final chromosomal configuration, 
plus two mitotically unstable chromosomes: a dicentric 
chromosome, t(10*,l*), and an acentric chromosome 
fragment, t(lp,4q) (Fig. 5B). The two mitotically unstable 



Reference assembly of segments 

1A , IB* 



1-50,761,463 ' 50,761, 470-end 

4A* . 4B 



1-102,287,791 



105,745,828-end 



B 



Chromoplexy 



1-57,523,805-end 



4A*~10B 
10 A*- IB* 
1A--4B 



der(4) 



10A* 

t(10*,1*) ( D CI 

dicentric 



t(1,4) 
acentric 



|l 0A*| » 1 0a . 1 0b « 1 0c - fj Od -1 B*| 

1A 



|lA-4a| . 4b • 4c • 4d • 4e • 4f «|4b] 
Chromothripsis 



B|.-4d «|l0d -Tb*] 



der(10) 



|l 0A*| . 4b . -1 0b • 4c • -1 0c • 4f . 4e • 1 0a • - |4a — Ta] 




Figure 5. Possible decomposition of rearrangements 
reported by Kloosterman et al. (2011a) into two chains. 
The first chain is plausibly a chromoplexy event that 
swaps partners between six chromosomal arms, shown 
in A. Asterisks denote chromosomal segments contain- 
ing a centromere. Newly formed junctions are shown 
as red links. This chromoplexy episode would create 
one functional (single centromere-containing) deriva- 
tive chromosome, t(4p*, 10q), plus two aberrant ones, 
t(10p*,lq*) (dicentric) and t(lp,4q) (acentric), as shown 
in B. The unstable dicentric and acentric chromosomes 
could then have triggered the remaining rearrange- 
ments shown in C, which are interlinked and resemble 
a chromothripsis. The outcome of this second chain is 
the swap of partners between the two nonfunctional 
chromosomes 10A*-1B* and 1A-4B, which results in 
two new derivative chromosomes, 10A*-(-lA) and 
(-4B)-1B*. This event is accompanied by fragmentation 
and repair of multiple segments near the junctions from 
the first chromoplexy event (10a, 10b, and 10c; and 4b, 
4c, 4d, 4e, and 4f). 
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derivatives would either be lost or undergo additional 
rearrangement. It is therefore possible that unstable chro- 
mosomes generated by the original chain of translocations 
(chromoplexy) could result in subsequent catastrophes 
that add chromothripsis on top of the original transloca- 
tions. This speculative model, which we term transloca- 
tion-induced chromothripsis, is one potential mechanism 
for chromothripsis involving multiple chromosomes. 

Mechanisms for complex genome rearrangements 

The discovery of chromothripsis, chromoanasynthesis, 
and chromoplexy suggests the existence of novel pro- 
cesses impacting genome architecture. The underlying 
mechanisms are likely to be different for each phenom- 
enon. However, any mechanisms need to explain the 
common theme of single-event-mediated extensive dam- 
age and, for chromothripsis and chromoanasynthesis, the 
fact that the damage is highly localized. Little is known 
about these underlying mechanisms, but we briefly re- 
view current thinking below. 

For chromoanasynthesis, the sequence signatures at 
translocation junctions strongly implicate replication- 
based mechanisms (Liu et al. 2011, 2012). The fact that 
chromoanasynthesis is characterized by apparent tem- 
plate insertions from nearby genomic regions, often con- 
tains microhomology at the breakpoint junctions, and 
lacks the sequence signatures of DSBs strongly implicates 
replication template-switching mechanisms. These mech- 
anisms have been termed fork stalling and template 
switching (FoSTeS) and MMBIR (Slack et al. 2006; Lee 
et al. 2007; Carvalho et al. 2009; Hastings et al. 2009; 
Zhang et al. 2009a,b). Best characterized in yeast (for 
recent reviews, see Anand et al. 2013; Lambert and Carr 
2013), these mechanisms can account for duplications 
(Narayanan et al. 2006; Hicks et al. 2010) and fold-back 
inversions (Mizuno et al. 2013). The template switches can 
arise by recombination involving invasion of homologous 
sequences or the use of microhomologies to prime ectopic 
DNA synthesis. Also consistent with a role for replication 
defects in this type of errors, replication stress induced by 
hydroxyurea or aphidicolin can induce copy number varia- 
tion in human cells (Arlt et al. 2009, 2012). Replication 
defects also can explain the regional bias for the abnormal- 
ities of chromoanasynthesis: There is evidence that break- 
point ends of copy number abnormalities tend to be in 
physical proximity (Lieberman-Aiden et al. 2009; De and 
Michor 2011; Fudenberg et al. 2011); hence, different 
template-switching events could have occurred in loosely 
defined nuclear territories. Nevertheless, without direct 
experimental recapitulation of chromoanasynthesis, the 
inferred models for the mechanism, although supported 
by strong indirect evidence, remain provisional (Currall 
et al. 2013; Kloosterman and Cuppen 2013). 

For chromoplexy, no mechanistic information is cur- 
rently available. It is clear that many breaks must be 
present around the same time for chromoplexy to occur. 
Thus any event that can induce multiple breaks, such as 
DNA replication stress (Donley and Thayer 2013) or active 
transcription (for review, see Kim and Jinks-Robertson 



2012) , could lead to concurrent DSBs in multiple chro- 
mosomes. Chained translocations to repair these breaks 
could then occur by chance, with selection acting to 
preserve copy number balance and oncogene activation at 
the breakpoint junctions. It is also possible that chained 
translocations are promoted by physical clustering of the 
breakpoints in the nucleus (Roix et al. 2003; Meaburn 
et al. 2007; Misteli 2010; Alt et al. 2013). This could occur 
by clustering of commonly regulated transcriptional 
sites or other clustering events, such as the assembly of 
DNA replication factories. In the case of prostate cancer, 
colocalization of transcriptional sites could be mediated 
by colocalization of androgen-responsive elements (Lin 
et al. 2009), which could cluster topoisomerase II-medi- 
ated DSBs (Haffner et al. 2010). 

The origin of chromothripsis is also unclear, but plausi- 
ble hypotheses have been recently advanced (Forment 
et al. 2012; Holland and Cleveland 2012; Jones and 
Jallepalli 2012; Kloosterman and Cuppen 2013). The major 
puzzle is how a large number of DSBs can be so highly 
localized. One proposal is that shattering occurs during 
mitosis when the chromosomes are highly compacted. 
At this point in the cell cycle, it is conceivable that 
external damage such as from ionizing radiation could 
slice through multiple segments of a single chromosome 
(Stephens et al. 2011). Critical telomere shortening lead- 
ing to BFB cycles has also been considered because the 
induced rearrangements are local and chromothripsis is 
often observed near the telomeres (Stephens et al. 2011). 
However, disfavoring this explanation is the fact that BFB 
cycles can result in gene amplification and should pro- 
duce a bias for inverted rearrangements. 

One simple solution to generate massive local muta- 
genesis is when one or two chromosomes (or an isolated 
acentric chromosome arm) are physically isolated from 
the other chromosomes. A simple and appealing hypoth- 
esis for how this could occur with fairly high frequency in 
human cancers is the partitioning of whole chromosomes 
or acentric chromosome fragments into micronuclei 
(Fig. 6; Crasta et al. 2012). It has become apparent that 
chromosomes in micronuclei can be subject to massive 
spontaneous DNA damage (Terradas et al. 2009; Forment 
et al. 2012; Holland and Cleveland 2012), and this damage 
occurs in the first cell cycle after micronuclei are formed 
(Crasta et al. 2012). Importantly, damaged micronuclei 
can be reincorporated into the main nucleus during sub- 
sequent cell division (Crasta et al. 2012; Hatch et al. 

2013) . This means that damaged micronuclei can, at least 
in principle, generate mutations that can be integrated 
back into the genome. 

Micronuclei are structurally similar to regular nuclei 
in that they have a double-membrane structure and 
nuclear pores but contain only one or a few chromosomes 
or chromosomal segments. Spatial isolation of individual 
chromosomes in micronuclei provides a natural setup for 
localized chromosomal damage, but how does damage 
happen? 

One possible mechanism is that newly generated mi- 
cronuclei acquire their DNA damage as they progress 
into the S phase and G2 phase of the subsequent cell cycle 
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Figure 6. Extensive DNA damage of chromosomes in micro- 
nuclei (MN). Micronuclei can form by missegregation of whole 
chromosomes after mitotic errors. Pictured here is a "merotelic" 
kinetochore attachment where a single intact chromatid is 
caught in a tug-of-war between microtubules from opposite poles. 
Micronuclei can also originate from chromosome breaks, gen- 
erating acentric chromosome fragments, or be generated from 
chromosome bridges by a mechanism that has not been de- 
termined (Hoffelder et al. 2004). (Left) Newly generated micro- 
nuclei can undergo irreversible loss of nuclear membrane integrity 
(Hatch et al. 2013). If this occurs during S phase, chromosomes in 
micronuclei will acquire massive DNA damage. {Right) Alter- 
natively, late-replicating chromosomes in micronuclei may un- 
dergo DNA damage if the cell enters mitosis before micronuclear 
replication is complete (Crasta et al. 2012), subjecting the chromo- 
some to premature chromosome compaction (PCC). The extent 
and mechanism of DNA damage during PCC remains to be 
established. During the subsequent cell division cycle, the 
micronuclear genome can be reincorporated into the primary 
nucleus, remain isolated in a micronucleus, or possibly be lost 
due to autophagy or extrusion. 



(Crasta et al. 2012; Hatch et al. 2013). Micronuclei 
replicate their DNA asynchronously with the primary 
nucleus, with many micronuclei showing DNA replica- 
tion in G2 (Fig. 6; Crasta et al. 2012). This delayed rep- 
lication means that some micronuclei might enter mitosis 
and undergo premature chromosome compaction (PCC) 
(Obe and Beek 1975) before they have completed DNA 
replication (Fig. 6; Donley and Thayer 2013). PCC is 
believed to cause DNA damage, as evidenced by gaps 
observed on chromosome spreads (Kato and Sandberg 1967; 
Johnson and Rao 1970) and the fact that PCC can induce 
breakage at chromosome fragile sites (Smith et al. 2001; El 
Achkar et al. 2005; Ravi et al. 2013). Indeed, our group has 
found that in chromosome spreads, late-replicating micro- 
nuclear DNA develops the pulverized appearance that is 
characteristic of PCC (Crasta et al. 2012). 

A recent study from Hetzer's laboratory (Hatch et al. 
2013) has provided important new insight into additional 



defects in micronuclei that could cause DNA damage. 
Using live-cell imaging, the group had previously shown 
that cancer cell nuclei occasionally undergo spontaneous 
loss of nuclear integrity (as assayed by cytoplasmic 
spilling of a nuclear green fluorescent protein) that is 
reversible (Vargas et al. 2012). They have now reported 
that many micronuclei undergo irreversible loss of nu- 
clear envelope (NE) integrity, as evidenced by fluorescent 
imaging and electron microscopy. The basis for the defect 
in NE integrity is still not clear, but it is correlated with 
abnormal and asymmetric nuclear lamina assembly. Most 
strikingly, the rupture of micronuclei is tightly correlated 
with the appearance of massive damage (Fig. 6). Although 
it has not been possible to directly image the timing of 
damage acquisition relative to the timing of rupture, fixed 
cell experiments strongly suggest that damage occurs close 
to the time of rupture. 

How NE rupture contributes to DNA damage is not 
known. We found that micronuclei in serum-starved cells 
readily undergo rupture but acquire little or no damage (A 
Spektor and D Pellman, unpubl.). Thus, rupture seems to 
be required for damage but not sufficient to generate 
damage. One possibility would be that NE rupture during 
DNA replication would abruptly dilute replication fac- 
tors and cause DNA replication fork stalling and thus 
generate damage. Alternatively, rupture during S phase 
might enable enzymes such as endonucleases or topoisom- 
erases to act aberrantly on micronuclear chromosomes. 

Conclusion 

The discovery of chromothripsis by Stephens et al. (201 1) 
opened a floodgate of genome sequencing studies of 
complex, potentially all-at-once genome rearrangement. 
This has led to the identification of chromothripsis in 
many cancer types and in some congenital diseases. The 
flurry of genome sequencing has also uncovered other 
classes of complex rearrangements (chromoanasyn thesis 
and chromoplexy), which also appear to originate from 
single catastrophic events. These discoveries have ex- 
panded our view of the mechanisms by which genomes 
can evolve. Although we have no evidence at present that 
these mechanisms contribute to speciation, this is cer- 
tainly plausible. 

These unexpected discoveries were the fruits of new 
technical advances, particularly next-generation se- 
quencing and the accompanying novel bioinformatics 
tools. The recent studies illustrate the fact that these 
events are more common than could have been expected, 
and further sequencing efforts will clarify their true 
frequency. However, as discussed above, the phenomena 
identified to date could be the extreme examples of even 
more frequent but less dramatic events. Thus, under- 
standing the underlying mechanism is not only interest- 
ing in its own right but may impact our understanding of 
how common these events really are. 

In this review, we tried to illustrate the importance, 
power, and limitations of bioinformatic and computa- 
tional approaches to reconstruct the history of complex 
chromosomal rearrangements (Stephens et al. 2011; 
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Greenman et al. 2012a ; Baca et al. 2013). Although 
reconstructing history from a snapshot of the genome is 
a challenging task, we hope that we conveyed the tremen- 
dous progress toward solving this open problem. 

The understanding of catastrophic chromosomal rear- 
rangements is certainly incomplete until the biological 
mechanisms underlying these phenomena are defined. 
Indeed, one exciting fallout from the genome sequencing 
efforts is the potential to discover new biological mech- 
anisms. For example, if it can be shown that damage 
occurring in micronuclei can lead to chromothripsis, this 
could provide an important example of the emerging role 
of nuclear architecture and subnuclear organization in 
maintaining the stability of the genome (Meaburn et al. 
2007; Misteli and Soutoglou 2009; Misteli 2010; Alt et al. 
2013). Ultimately, linking postulated biological mecha- 
nisms to these new classes of genome rearrangement will 
require an experimental recapitulation of the phenomena 
in the laboratory. As the physicist Richard Feynman 
famously said, "What I cannot create, I do not under- 
stand." Combining experimental live-cell imaging ap- 
proaches with emerging single-cell genome sequencing 
technologies (Shapiro et al. 2013) may be one way to recreate 
and thus understand complex chromosomal rearrange- 
ments. Thus, the use of genome sequencing as a hypoth- 
esis-generating discovery tool may be ushering in a new 
era where genome sequencing is harnessed for hypothesis- 
testing experimental biology. 
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